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Dkt. No. 7703/3 

METHODS AND COMPOSITIONS FOR AUTISM RISK ASSESSMENT 
Background 

(1) Field of the Invention 

5 The present invention generally relates to autism risk assessment. More 

specifically, methods and compositions are provided that are useful for estimating 
risk for having autism. 

(2) Description of the Related Art 
References cited 

10 1. Lord C, Leventhal BL, Cook EH Jr: Quantifying the phenotype in autism 
spectrum disorders. Am J Med Genet 2001; 105:36-38 

2. Lord C, Cook EH, Leventhal BL, Amaral DG: Autism spectrum disorders. 
Neuron 2000; 28:355-63 

3. Rapin I, Katzman R: Neurobiology of autism. Ann Neurol 1998; 43:7-14 
15 4. Folstein SE, Rosen-Sheidley B: Genetics of autism: complex aetiology for a 

heterogeneous disorder. Nat Rev Genet 2001; 2:943-955 

5. Rutter M: Concepts of autism: a review of research. J Child Psychol 
Psychiatry 1968; 9:1-25 

6. Bailey A, Le Couteur A, Gottesman I, Bolton P, Simonoff E, Yuzda E, Rutter 
20 M: Autism as a strongly genetic disorder: evidence from a British twin study. 

Psychol Med 1995; 25:63-77 

7. Lucinio J and Alvarado J: Progress in the genetics of autism. Mol Psychiatry 
2002; 7:1012-1017 
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9. Buxbaum JD, Silverman JM, Smith CJ, Greenberg DA, Kilifarski M, Reichert 
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5 polymorphisms and autism. Mol Psych 2002; 7:311-316 

10. Jamain S, Betancur C, Quach H, Philippe A, Fellous M, Giros B, Gillberg C, 
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with autism. Mol Psychiatry 2002; 7:302-310 
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10 Leventhal BL, Courchesne E, Cook EH Jr: Transmission disequilibrium mapping at 

the serotonin transporter gene (SLC6A4) region in autistic disorder. Mol Psychiatry 
2002; 7:278-288 

12. Veenstra-Vanderweele J, Cook E Jr, Lombroso PJ: Genetics of childhood 
disorders: XLVI. Autism, part 5: genetics of autism. J Am Acad Child Adolesc 

15 Psychiatry 2003; 42:116-118 

13. Buxbaum JD, Silverman JM, Smith CJ, Kilifarski M, Reichert J, Hollander E, 
Lawlor BA, Fitzgerald M, Greenberg DA, Davis KL: Evidence for a susceptibility 
gene for autism on chromosome 2 and for genetic heterogeneity. Am J Hum Genet 
2001; 68:1514-1520 

20 14. International Molecular Genetic Study of Autism Consortium (IMGSAC): A 
genomewide screen for autism: strong evidence for linkage to chromosomes 2q, 7q, 
and 16p. Am J Hum Genet 2001; 69:570-581 

15. Shao Y, Raiford KL, Wolpert CM, Cope HA, Ravan SA, Ashley-Koch AA, 
Abramson RK, Wright HH, DeLong RG, Gilbert JR, Cuccaro ML, Pericak-Vance MA: 

25 Phenotypic homogeneity provides increased support for linkage on chromosome 2 
in autistic disorder. Am J Hum Genet 2002; 70:1058-1061 

16. Bacchelli E, Blasi F, Biondolillo M, Lamb JA, Bonora E, Barnby G, Parr J, 
Beyer KS, Klauck SM, Poustka A, Bailey AJ, Monaco AP, Maestrini E: Screening of 
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nine candidate genes for autism on chromosome 2q reveals rare nonsynonymous 
variants in the cAMP-GEFII gene; Mol Psychiatry 2003; 8:916-24 

17. Weiss LA, Escayg A, Kearney JA, Trudeau M, MacDonald BT, Mori M, 
Reichert J, The AGRE Consortium, Buxbaum JD, Meisler MH: Sodium channels 

5 SCN1A, SCN2A and SCN3A in familial autism. Mol Psychiatry 2002; 8:182-190 

18. Geschwind DH, Sowinski J, Lord C, Iversen P, Shestack J, Jones P, Ducat L, 
Spence SJ: The autism genetic resource exchange: a resource for the study of 
autism and related neuropsychiatric conditions. Am J Hum Genet 2001; 69:463- 
466 

10 19. Lord C, Rutter M, Le Couteur A: Autism Diagnostic Interview-Revised: a 
revised version of a diagnostic interview for caregivers of individuals with possible 
pervasive developmental disorders. J Autism Dev Disord. 1994; 24:659-685 

20. Olivier M, Chuang LM, Chang MS, Chen YT, Pei D, Ranade K, de Witte A, 
Allen J, Tran N, Curb D, Pratt R, Neefs H, de Arruda Indig M, Law S, Neri B, Wang 

15 L, Cox DR: High-throughput genotyping of single nucleotide polymorphisms using 
new biplex invader technology. Nucleic Acids Res 2002; 30:e53 

21. Spielman RS, Ewens WJ: A sibship test for linkage in the presence of 
association: the sib transmission/disequilibrium test. Am J Hum Genet 1998; 
62:450-458 

20 22. Durner M, Vieland VJ, Greenberg DA: Further evidence for the increased 
power of LOD scores compared with nonparametric methods. Am J Hum Genet 
1999; 64:281-289 

23. Hodge SE, Vieland VJ, Greenberg DA: HLODs remain powerful tools for 
detection of linkage in the presence of genetic heterogeneity. Am J Hum Genet 

25 2002; 70:556-559 
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25. del Arco A, Satrustegui J: Molecular cloning of Aralar, a new member of the 
mitochondrial carrier superfamily that binds calcium and is present in human 
muscle and brain. J Biol Chem 1998; 273:23327-23334 

26. Palmieri L, Pardo B, Lasorsa FM, del Arco A, Kobayashi K, Iijima M, 

5 Runswick MJ, Walker JE, Saheki T, Satrustegui J, Palmieri F: Citrin and aralarl are 
Ca(2+)-stimulated aspartate/glutamate transporters in mitochondria. EMBO J 
2001; 20:5060-5069 

27. Ramos M, del Arco A, Pardo B, Martinez-Serrano A, Martinez-Morales JR, 
Kobayashi K, Yasuda T, Bogonez E, Bovolenta P, Saheki T, Satrustegui J: 

10 Developmental changes in the Ca2+ -regulated mitochondrial aspartate-glutamate 
carrier aralarl in brain and prominent expression in the spinal cord. Dev Br 
Research 2003; 143:33-46 

28. Filipek PA, Juranek J, Smith M, Mays LZ, Ramos ER, Bocian M, Masser-Frye 
D, Laulhere TM, Modahl C, Spence MA, Gargus JJ: Mitochondrial dysfunction in 

15 autistic patients with 15q inverted duplication. Ann Neurol 2003; 53:801-804 
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20 Genet 2003; 33:(Suppl)228-237 
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S, Tysk C, O'Morain CA, Gassull M, Binder V, Finkel Y, Cortot A, Modigliani R, 
Laurent-Puig P, Gower-Rousseau C, Macry J, Colombel J-F, Sahbatou M, Thomas 
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25 disease. Nature 2001; 411:599-603 
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Baier LJ, Bogardus C, Groop L, Boerwinkle E, Hanis CL, Bell GI: Genetic variation 
in the gene encoding calpain-10 is associated with type 2 diabetes mellitus. Nat 
Genet 2000; 26:163-75 

33. Yan H, Yuan W, Velculescu VE, Vogelstein B, Kinzler KW: Allelic variation in 
5 human gene expression. Science 2002; 297:1143-5 

34. Bray NJ, Buckland PR, Owen MJ, O'Donovan MC: Cis-acting variation in the 
expression of a high proportion of genes in human brain. Hum Genet 2003; 149-53 

35. Lasorsa FM, Pinton P, Palmieri L, Fiermonte G, Rizzuto R, Palmieri F: 
Recombinant expression of the Ca(2+)-sensitive aspartate/glutamate carrier 

10 increases mitochondrial ATP production in agonist-stimulated Chinese hamster 
ovary cells. J Biol Chem 2003; 278:38686-92 

36. Nyholt DR. All LODs are not created equal. Am J Hum Genet 200:282-288. 

Autism or autistic disorder (MIM #209850) is a neurodevelopmental 
disorder characterized by a deficit in verbal and non-verbal communication, 

15 impairments in reciprocal social interactions, and patterns of repetitive or 

stereotyped behaviors and interests (1-3). The sex-ratio is 4:1 male to female, and 
the prevalence of the disease is currently thought to possibly be above 1 per 1000 
persons (4). Autism appears to be the most highly genetic of the psychiatric 
disorders as evidenced by the high risk of autism in additional children in families 

20 with an autistic child (estimated to be 50 to 100 times greater than that expected 
by chance) and the concordance rate for monozygotic twins being much higher 
than that of dizygotic twins (5). Heritability estimates of idiopathic autism are 
above 90% (6), so much of the disorder can be attributed to a genetic etiology. 
However, autism does not follow a simple Mendelian mode of transmission (i.e., 

25 dominant or recessive transmission) but is clearly a polygenic disease (4), A 

commonly accepted genetic model involves several genes (between 5 to 10) that 
interact to produce the disorder. 
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A genetic mutation or variant segregating with autism has yet to be 
unequivocally identified. Candidate genes for studies of autism range from genes 
that are thought to play a role in neurodevelopmental pathways, comportment or 
behavior, such as genes in the serotonergic pathway or reelin (4, 7, 8). A few 
5 polymorphisms in several genes have been associated with the disorder in certain 
studies, but not in others (4, 9-11). 

Several independent studies involving genome-wide scans have now been 
published and point to significant linkage between autism and chromosome 2q and 
7q regions (4, 12). Our studies defined chromosome 2q24-q33 as a susceptibility 

10 region for autism with a peak at D2S335, particularly evident in families with more 
severe autism [as defined by delayed onset (over 36 months) of phrase speech 
(phrase speech delay, PSD) (MIM #606053)] (NPL score of 3.32 and a HLOD of 
2.99) (13). Using a cohort of 152 autism sibling-pair families mostly from 
European countries, the International Molecular Genetic Study of Autism 

15 Consortium (IMGSAC) reported their highest multipoint LOD score (MLS=3.74) at 
D2S2188 in families with autism with language delay (defined in that study as no 
single word before 24 months and/or no phrase speech before 33 months) (14). 
When stricter diagnostic criteria were used, the MLS increased to a value of 4.8. 
Finally, a study from the Collaborative Autism Team using the PSD criteria to 

20 weight its data also showed a linkage between autism and chromosome region 
2q33, with an MLS of 2.86 and a HLOD of 2.12 at D2S116 (15). 

D2S335, D2S2188 and D2S116 are localized on chromosome 2 at 171 
megabases (Mb), 174.4 Mb and 200.5 Mb, respectively (Figure 1A). This indicates 
that a critical region of susceptibility for autism occurs near D2S335 and D2S2188 

25 in 2q31. In this interval, several known genes and expressed sequence tags have 
been mapped (Figure IB). Recently, the IMGSAC has reported the analysis of nine 
candidate genes (TBR1, GAD1, DLX1, DLX2, cAMP-GEFII, CHN1, CREB2, HOXD1 
and NEUROD1) localized across a 30 Mb region of 2q, that are expressed in the 
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central nervous system and encode proteins that play a role in neuronal cells or in 
neurobiological pathways (16). Variants were observed in TBR1, cAMP-GEFII, 
CHN1, HOXD1 and NEUROD1. However, no evidence was found that any of the 
candidate genes contributes to autism. In this region, our laboratory, together with 
5 the laboratory of Dr. Miriam Meisler, has previously investigated the neuronal 
voltage-gated sodium channels type I, II and III (SCN1A, SCN2A and SCN3A) in 
117 multiplex autism families (17). Rare mutations were identified, each in single 
families, that were not observed in controls. These mutations, while of great 
interest, are not likely to account for the evidence of linkage observed in this 
10 region. 

There is thus a need to pinpoint genetic variations associated with autism, in 
order to be able to determine whether individuals are at particular risk for autism, 
and to determine the risk for autism in offspring of two individuals. Identification 
of genetic risk factors would also provide tools and information for elucidating the 
15 causes of autism. The present invention addresses that need. 

Summary of the Invention 

Accordingly, the inventors have discovered that the risk of autism is 
increased in individuals having the G allele at either or both polymorphism sites 
rs2056202 and rs2292813 of the SLC25A12 gene. 

20 Thus, in some embodiments, the invention is directed to method of 

evaluating an individual for relative genetic risk for autism. The methods comprise 
determining the individual's genotype at polymorphism sites rs2056202 and/or 
rs2292813 of the SLC25A12 gene. In these embodiments, the presence of a G at 
either of the two sites indicates an increased risk for autism, and the presence of an 

25 increasing number of G's at the sites indicates an increasing risk for autism. 

The invention is also directed to sets of two primers suitable for use in 
polymerase chain reaction, useful for the methods identified immediately above. 
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The invention is additionally directed to kits comprising the above-identified 
primers. 

In other embodiments, the invention is directed to polynucleotides 
consisting of any of the sequences of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, 
5 and SEQ ID NO:4. 

The invention is further directed to methods of identifying a form of a 
genetic polymorphism that is linked to autism. The methods comprise identifying a 
polymorphism in the SLG25A12 gene and determining whether one form of the 
polymorphism is present in autistic individuals more than another form. In these 
10 embodiments, in the form that is present more often in autism is linked to autism. 
The polymorphisms identified by these methods can also be used to determine the 
risk of an individual to autism. 

In further embodiments, the invention is directed to eukaryotic cells 
comprising a transgenic human SLC25A12 gene, and non-human animals 
15 comprising those cells. 

Additionally, the invention is directed to methods of evaluating whether a 
compound affects autism. The methods comprise contacting the compound with 
the above described eukaryotic cell, then determining whether the compound 
affects expression or activity of a product of the SLC25A12 gene. In these 
20 embodiments, a compound that affects expression or activity of the product of the 
SLC25A12 gene affects autism. 

Brief Description of the Drawings 

FIG. 1 is three graphics showing the genomic organization of the autism 
susceptibility locus on chromosome region 2q24-q33. Panel A shows the genetic 
25 and cytogenetic mapping. Panel B shows the organization of positional candidate 
genes. Arrowhead indicates orientation of transcription. Panel C shows the genomic 
structure of SLC25A12 gene. Variants identified in the present study are indicated 

257727.1 



-9- 

(see text), with the two SNPS focused on in the current study, rs2056202 (13- 
21A/G) and rs2292813 (I16+70A/G) underlined. 

FIG. 2 is a table showing the results of a relative risk assessment using 
polymorphisms of the SLC25A12 gene. 



5 Detailed Description of the Invention 

The present invention is based on the discovery of an association between 
certain polymorphisms in the SLC25A12 gene and autism. This discovery enables 
various compositions and methods for use in various aspects of autism diagnostics, 
therapeutics, and research. 

10 Thus, in some embodiments, the invention is directed to method of 

evaluating an individual for relative genetic risk for autism. The methods comprise 
determining the individual's genotype at polymorphism sites rs2056202 and/or 
rs2292813 of the SLC25A12 gene. In these embodiments, the presence of a G at 
either of the two sites indicates an increased risk for autism, and the presence of an 

15 increasing number of G's at the sites indicates an increasing risk for autism. 

Any human can be tested using these embodiments, and these methods can 
also be used to determine the potential risk of the offspring of two individuals for 
autism, based on the identified genotype of the parents at the relevant 
polymorphism. 

20 The invention is not narrowly limited to any particular method for 

determining the individual's genotype at the relevant polymorphic sites; the skilled 
artisan could chose and apply an appropriate method for any particular application 
without undue experimentation. In some preferred embodiments, the genotype is 
determined by single strand conformation polymorphism, denaturing high- 

25 performance liquid chromatography, DNA Invader, and/or polymerase chain 
reaction amplification followed by sequencing. See, e.g., the Example. 
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In those embodiments employing polymerase chain reaction (PCR), 
preferred primers for amplifying the appropriate regions are comprise SEQ ID NO:5 
and SEQ ID NO:6 (for amplifying the relevant region of rs2056202 [these primers 
amplify SEQ ID NO:l or SEQ ID NO:2, depending on the polymorphic form 
5 present]) or SEQ ID NO:7 and SEQ ID NO:8 (for amplifying the relevant region of 
rs2292813 [amplifying SEQ ID NO:3 or SEQ ID NO:4]). 

The invention is also directed to sets of two primers suitable for use in 
polymerase chain reaction, useful for the methods of determining an individual's 
risk for autism, as described above. Preferred primer sets are SEQ ID NO:5 and 

10 SEQ ID NO:6, for amplifying the relevant region of rs2056202, and SEQ ID NO:7 
and SEQ ID NO:8, for amplifying the relevant region of rs2292813. However, any 
other primers that are specific for either of the relevant regions of the SLC25A12 
gene could also be used, and can be designed without undue experimentation. 
In additional embodiments, the invention is directed to kits comprising at 

15 least one set of primers suitable for use in polymerase chain reaction (PCR). The 
set of primers in these kits amplifies polymorphism site rs2056202 or rs2292813, 
or both of the SLC25A12 gene. Consistent with the above discussion describing the 
relevant primers, preferred primer sets are SEQ ID NO:5 and SEQ ID NO:6, for 
amplifying the relevant region of rs2056202, and SEQ ID NO:7 and SEQ ID NO:8, 

20 for amplifying the relevant region of rs2292813. However, any other primers that 
are specific for either of the relevant regions of the SLC25A12 gene could also be 
used. These kits can also comprise instructions for using the set(s) of primers to 
evaluate an individual for relative genetic risk for autism by determining the 
genotype of the polymorphic sites re2056202 and/or re2292813 of the SLC25A12 

25 gene. The kits can also comprise other ingredients such as buffers, enzymes and 
the like, for performing PCR and/or analysis of the PCR product(s). 
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The invention is additionally directed to PCR products amplified using any 
of the above-described primer sets. Nonlimiting examples of these PCR products 
include SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, and SEQ ID NO:4. 

The invention is further directed to methods of identifying a form of a 
5 genetic polymorphism that is linked to autism. The methods comprise identifying a 
polymorphism in the SLC25A12 gene and determining whether one form of ±e 
polymorphism is present in autistic individuals more than another form. In these 
embodiments, in the form that is present more often in autism is linked to autism. 
The polymorphisms identified by these methods can also be used to determine the 
10 risk of an individual to autism. Nonlimiting examples of procedures to conduct 
these methods are provided in the Example. Genetic polymorphisms identified by 
the above procedure that are associated with autism can also be used to evaluate 
an individual for relative genetic risk for autism. 

Since this invention includes the discovery that the SLC25A12 gene is 
15 associated with autism, the skilled artisan would understand that cells and 
multicellular organisms comprising the human SLC25A12 gene are useful for 
autism research. 

Thus, the instant invention is also directed to eukaryotic cells comprising a 
transgenic human SLC25A12 gene. In some embodiments, these eukaryotic cells 
20 are particularly useful when the transgenic SLC25A12 gene comprises the sequence 
of SEQ ID NO:2 and/or SEQ ID NO:4, since those sequences are within SLC25A12 
genes associated with autism. 

In these embodiments, the cell is a yeast cell, or a mammalian cell, such as a 
brain cell. The cell can also be within a living mammal, such as a transgenic 
25 mammal transfected with the gene. 

Thus, in related embodiments, the invention is directed to non-human 
animals comprising the above described eukaryotic cells. Preferably, the non- 
human animal of these embodiments is a mammal. 
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Since it is likely that the SLC25A12 gene product is altered in autism, a 
mitochondrial aspartate/glutamate carrier (AGO) (see Example), chemical 
compounds that affect AGC1 would be expected to affect autism. 

Thus, the invention is also directed to methods of evaluating whether a 
5 compound affects autism. The methods comprise contacting the compound with 
any of the above eukaryotic cells and determining whether the compound affects 
expression or activity of a product of the SLC25A12 gene. In these embodiments, a 
compound that affects expression or activity of the product of the SLC25A12 gene 
affects autism. The compounds useful for ±ese embodiments can be a small 
10 organic or inorganic molecule or a macromolecule such as an antibody, an 
aptamer, an siRNA, an antisense compound, etc. 

Preferred embodiments of the invention are described in the following 
examples. Other embodiments within the scope of the claims herein will be 
apparent to one skilled in the art from consideration of the specification or practice 
15 of the invention as disclosed herein. It is intended that the specification, together 
with the examples, be considered exemplary only, with the scope and spirit of the 
invention being indicated by the claims which follow the examples. 

Example 1. 
Example summary 

20 Objective: Autism/autistic disorder (MIM #209850) is a complex, largely 

genetic psychiatric disorder. We recently mapped a susceptibility locus for autism 
to chromosome region 2q24-q33 (MIM #606053). In the present study, we 
analyzed genes across the 2q24-q33 interval to identify an autism susceptibility 
gene in this region. 

25 Method : Mutation screening of positional candidate genes was performed in 

two stages. The first stage involved identifying genetic variants in exons and 
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flanking sequence within candidate genes, in unrelated subjects showing linkage to 
2q24-q33, and comparing the frequency of the variants between subjects and 
controls. Two single nucleotide polymorphisms (SNPs) that showed evidence for 
divergent distribution between subjects and controls were identified, both within 
5 SLC25A12, a gene encoding the mitochondrial aspartate/glutamate carrier (AGC1). 
In the second stage, the two SNPs in SLC25A12 were further genotyped in 411 
autistic families, and linkage and association tests were carried out in the 197 
informative families. 

Results: Linkage and association were observed between autistic disorder 

10 and the two SNPs, rs2056202 and rs2292813, found in SLC25A12. Using a single 
affected per family, evidence for excess transmission was found by the transmission 
disequilibrium test (TDT) for rs2056202 (% 2 = 10.83, df=l, P=0.001), rs2292813 
(X 2 =6.23, df=l, P=0.01), and a two-locus G*G haplotype (/ 2 =22.10, df=l, 
P= 0.000003). Using multiple affected individuals per family demonstrated 

15 evidence for linkage by the TDT for rs2056202 Q( 2 =8.89, df=l, P=0.003) and 
rs2292813 (X 2 =7.28, df=l, P=0.007), and for the two-locus haplotype 
(X 2 =20.41, df=l, P=0.000006). Evidence for linkage was supported by linkage 
analysis with the two SNPs, with a maximal multipoint NPL score of 1.64 and a 
maximal multipoint heterogeneity LOD score of 2.28. 

20 Conclusions: Our studies demonstrated strong association of SNPs within the 

SLC25A12 gene with autism. 
Introduction 

The aim of the present study was to identify an autism susceptibility gene in 
the 2q31 region. A systematic screen for genetic variants in affected individuals 
25 identified linkage and association between autism and single nucleotide 
polymorphisms in the SLC25A12 gene. 
Methods 
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Subiects. A total of four hundred eleven families were either recruited by 
the Seaver Autism Research Center (SARC) at Mount Sinai (n=40), co-recruited by 
SARC and the Autism Genetic Research Exchange (AGRE) (18) (n=127), or 
recruited by AGRE (n=244). All parents signed an informed consent and 
5 potentially affected individuals were assessed by the Autism Diagnostic Interview- 
Revised (ADI-R) (19). Individuals meeting ADI-R criteria for autism (19) or 
borderline autism (13) were defined as affected. The cohort (18) and research 
diagnosis definitions used in the current study (13) have been described. The entire 
cohort of 411 families included blood samples from more than 2000 individuals, 

10 including 720 affected individuals (671 with autism and 49 with borderline 

autism), and available parents and sibs. The 411 families included 274 multiplex 
(typically affected sibling-pairs) and 137 trio families. DNA from blood samples or 
transformed cells were either isolated as detailed in Buxbaum et al., 2001, or 
provided by the AGRE repository. 

15 Mutation screening. To investigate the involvement of positional candidate 

genes in autism, we performed a two-stage screen. 

In the first stage, exonic and flanking DNA from 35-47 patients from 38 
autistic families linked to the chromosome 2q24-q33 region were screened for 
genetic variants by single strand conformation polymorphism (SSCP) and 

20 denaturing high-performance liquid chromatography (DHPLC). The rationale for 
this approach was to find variants in linked, affected individuals, rather then rely 
on variants in public databases identified in unaffected individuals. Furthermore, 
the focus was on exonic sequence and intronic sequence adjacent to exons, as being 
most likely to harbor functionally important variants. 

25 Primers to amplify exons and flanking regions of positional candidate genes 

were designed using primer3 software (http://www-genome.wi.mit.edu/cgi- 
bin/primer/primer3_www.cgi) and are available on request. The forward and 
reverse primers were chosen to produce amplicons of less than 500 bp. Larger 
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exons, in particular the last exons of the genes, were divided over several 
amplicons, each with a size of less than 500 base pairs. Primers within flanking 
introns were situated 26 to 186 bp from the exons. A total of 82 exons of 9 genes 
were analyzed, using 93 amplicons with an average size of ca. 310 nucleotides. 
5 For SSCP, forward primers were labeled using 100 \iCi [yP32]-ATP and 10 

units of T4 polynucleotide kinase, according to the protocol provided by Invitrogen. 
Amplification was carried out using Ampli-Taq Gold Polymerase (Applied 
Biosystems) in a final volume of 10 consisting of 10 ng of genomic DNA, 10 mM 
Tris-HCl, 50 mM KC1, 2.5mM MgCl, 100 \iM dNTPs, 10 |iM radioactively-labeled 
10 forward and unlabeled reverse primers, and 0.5 units of Taq Polymerase. Two |il of 
radioactively-labeled PCR product was then mixed with 2 (il of formamide blue 
loading buffer, denatured and separated on a non-denaturing MDE 0.5X gel 
(according to the protocol of BioWhittaker). SSCP were detected by 
autoradiography. 

15 For DHPLC, 8 (A of PCR product was screened on the WAVE Nucleic Acid 

Fragment Analysis System (according to the manufacturer's protocols). 
Chromatographic parameters, appropriate analysis temperatures, and melting 
domains visualization were determined by WAVEMAKER software. Samples were 
using at least two mobile-phase temperatures to maximize the chances to identify 

20 polymorphisms. 

Any amplicons in which variants were detected by either SSCP or DHPLC 
were then sequenced by direct fluorescent sequencing of purified PCR products 
using the BIG DYE dideoxy-terminator kit v3.1 (Applied Biosystems) and an 
ABI3100 DNA sequencer with Performance Optimized Polymer 6 (Applied 

25 , Biosystems). Variants were then genotyped in 38 unrelated cases and in 100 
controls. Allele frequencies were compared to identify variants with potentially 
altered frequencies (defined as P=0.1) between ethnically matched cases and 
controls (to reduce the chances for false-positives due to stratification). 
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In the second stage, variants with frequencies that were potentially altered 
between cases and controls were then analyzed in the entire cohort of more than 
2000 individuals from 411 autistic families. Genotyping was carried out using the 
biplex DNA Invader method (Third Wave Technologies). The Invader assay is based 
5 on the hybridization of an oligonucleotide probe that completely matches a DNA 
target and the subsequent cleavage of the overlapping structure by Cleavase VIII, 
resulting in a target-specific product that is recognized by a fluorescence resonance 
energy transfer (FRET) cassette (20). Specific oligonucleotide probes 
corresponding to wild type and mutated SNPs are associated with specific 

10 fluorophores, enabling simultaneous detection of both DNA sequences in a single 
well. For our studies, diluted aliquots of the PCR products were combined with 
Invader mix solution (Third Wave Technologies), and incubated for the cleavage 
reaction in a thermal cycler (PTC-100, MJ Research). The reaction product was 
then analyzed on a fluorescence plate reader (CytoFluor Series 4000, PerSeptive 

15 Biosystems) using the appropriate parameters of excitation and emission for each 
fluorophore. 

Statistical analysis. Two-tailed chi-square tests (x 2 ) were used for 
comparisons of allelic frequencies and distributions of genotypes between control 
and autism groups. Fischer's exact test was performed when the number in a group 

20 was less than five. Hardy- Weinberg distribution was examined for each identified 
polymorphism in autistic and control groups. 

Statistical analyses for transmission disequilibrium tests (TDT) were 
computed with TDT-GENEHUNTER (GENEHUNTER version 2.1, compiled to run 
on the Unix environment of Mac OS X) and the S-TDT (21) program 

25 (http://genomics.med.upenn.edu/spielman/TDT.htm). Two-locus TDT was carried 
out with TDT-GENEHUNTER using the TDT2 option and the haplotypes were 
constructed by GENEHUNTER and verified manually to ensure structure. 
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Haplotypes were determined on the basis of transmission patterns in families in 
which both parents were genotyped. 

For linkage analysis, we used GENEHUNTER PLUS (compiled to run on the 
Unix environment of Mac OS X). Scores for heterogeneity LOD (HLOD) and 
5 nonparametric linkage (NPL) were calculated for both single and multi-point 
analyses. For HLOD, data were analyzed under both a dominant and recessive 
model, using 50% penetrance and a value of 0.001 for the disease allele frequency. 
Such an approach detects linkage under many different conditions irrespective of 
the "true" underlying inheritance pattern (22, 23). 
10 Linkage disequilibrium (LD) was estimated using a D' value calculated with 

the 2LD program (24). 
Results 

Genes across the 2q31 region were screened for association in two stages as 
detailed in the Methods. The genes analyzed included glutamate decarboxylase 1 

15 (GAD1) (in collaboration with Drs. Shigeo Kure, Kiyoshi Kanno and Yoichi 

Matsubara), four hypothetical proteins (FLJ13096, FU13984, LOC130672, and 
FU23462, recently identified as duodenal cytochrome b) (in collaboration with 
Drs. Paolo Gasparini and Massimo Carella), histone acetylase-1 (HAT-1) (in 
collaboration with Dr. Salah Uddin Qureshi), the cytoplasmic dynein subunit 

20 DNCI2, the asparate/glutamate carrier SLC25A12, and the homeobox protein DLX2 
(Figure 1C). These candidate genes were chosen based on their position relative to 
the positive linkage results from three studies (13-15), their expression in brain 
tissue, and, in some cases, their known function, their novelty, or the existence of 
related genes within the region of chromosome 7 showing linkage to autism. For 

25 this latter criterion, we note that the linked region of chromosome 7 contains genes 
paralogous to DNCI2, SLC25A12, DLX1 and DLX2 (i.e., DNCI1, SLC25A13, DLX5 
and DLX6). 
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In the first stage, all known exons (with flanking intronic sequence) of these 
genes were screened by SSCP and DHPLC for variants in 35 to 47 unrelated 
individuals chosen from families showing linkage to D2S335 as described in 
Methods. In the nine genes, 82 exons were screened and 29 SNPs were identified. 
5 Frequencies of each variant were then evaluated in the autistic patients (using only 
one affected individual per family, n=38) and in 50 ethnically matched controls, 
after confirming that the distribution of allele frequencies were in Hardy- Weinberg 
equilibrium. Only two SNPs, both within the SLC25A12 gene, showed significant 
differences in allele frequencies between cases and controls using both allele- 

10 (P< 0.004) and genotype-based (P<0.03) tests. 

Within the SLC25A12 gene, we identified a total of five variants in the first 
stage screen (including the two meeting criteria for further study, indicated above). 
Figure 1C presents the five variants of the SLC25A12 gene identified in 47 affected 
subjects linked to the chromosome 2q24-q33 region. The two polymorphism 

15 meeting criteria in the first stage, rs2056202 (I3-21A/G) and rs2292813 
(I16+70A/G), are G/A variants in flanking intronic sequence located 21 bp 
upstream of exon 4 and 70 bp downstream of exon 16, respectively. Two variants, 
C->T at nucleotide 99 (rsl878583) and G->A at nucleotide 1418, were within 
coding regions. G1418A changes arginine 473 to glutamine, while the C99T variant 

20 is silent. G1418A is a new SNP, not reported in the NCBI dbSNPs database, located 
in a region conserved across mammalian species, but the amino acid glutamine is 
observed in mice. The final variant appears in the 3' untranslated region (UTR). We 
did not find SNP rsl059299, reported in the public database, which changes amino 
acid 600, in our sample. 

25 Given the evidence for association of rs2056202 and rs2292813 in a small 

number of cases and controls, the entire sample was genotyped at these SNPs for 
analysis by the Transmission Disequilibrium Test (TDT), which makes use of 
family-based controls. Of the 411 families studied, 197 had at least one parent 
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heterozygous for at least one SNP. These families included 140 multiplex and 57 
singleton families. To test for association by the TDT, transmission from 
heterozygous parents to one affected child was analyzed (Table 1). TDT analysis 
demonstrated association for rs2056202 (% 2 = 10.83, df=l, P=0.001) and for 
5 rs2292813 (X 2 =6.23, df=l, P=0.01). In both cases, the G allele appeared to be the 
risk allele (or the A allele the protective allele) (for simplicity, Table 1 and 2 show 
transmission data for just the G allele for both SNPs). 
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TDT analysis was also carried out using multiple affected individuals per 
family (Table 2). Such analysis is more properly a measure of linkage rather than 
association. Transmission disequilibrium was observed for both rs2056202 
(X 2 =8.89, df=l, P=0.003) and rs2292813 (X 2 =7.28, df=l, P=0.007). 



5 Table 2. TDT with all affecteds per family 



NT 



X 



SNPs 

rs2056202 191 

rs2292813 124 
Haplotype 

10- G*G 163 

G*A 11 

A*G 44 

A* A 55 



137 
85 

91 
16 
71 
95 



8.89 
7.28 



0.003 
0.007 



20.41 6X10" 6 

0.93 0.34 

6.34 0.011 

10.67 0.001 



For each SNP or haplotype, the number of transmitted (T) or non-transmitted 
15 (NT) events are shown. For the individual SNPs, data is for the G allele. 



Looking at haplotypes, there was an increased transmission of the G*G 
haplotype in autism when analyzing either one affected per family (x 2 =22.10, 
df=l, P=0.000003) or all affecteds (x 2 =20.41, df=l, P=0.0000006). Using a 
global analysis, two-locus TDT showed disequilibrium of transmission of the four 
20 observed haplotypes for both one affected per family (x 2 =32.31, df=3, P=5X10" 7 ) 
or all affecteds (x 2 =28.76, df=3, P=0.000003). 

Two-point linkage analysis using nonparametric LOD score analysis (NPL), 
indicated some evidence for linkage between autism and rs2056202 (NPL=1.26, 
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P=0.07) or rs2292813 (NPL=1.1, P=0.09) (Table 3). Two-point heterogeneity 
LOD (HLOD) supported this linkage, with maximal HLOD scores of 2.11 (P=0.03) 
and 1.15 (P=0.1), for rs2056202 and rs2292813, respectively. However, 
information was low at these SNPs (estimated as 0.23 and 0.34 for rs2056202 and 
5 rs2292813, respectively). To increase information we used multipoint linkage 
analyses with these two SNPs. Under these conditions, maximal multipoint NPL 
scores of 1.57 and maximal multipoint HLOD scores of 2.11 were observed 
(information was increased to about 0.51). The two markers showed linkage 
disequilibrium with each other as determined by analyzing linkage disequilibrium 
10 in unrelated patients (D'=0.79, SD=0.06). 
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A genotype relative risk was also estimated for individuals carrying one or 
two copies of the risk alleles (the G alleles in both cases) for either rs2056202 or 
rs2292813. Results are given in FIG. 2. Those results show that the relative risk 
increases with increasing G alleles at either polymorphic site. It should also be 
5 noted that genotype relative risk estimates in TDT studies such as this 

underestimate the true genotype relative risk. See Risch, Theoret. Pop. Biol. 60, 
215-220 (2001). Therefore, the true autism risk for the genotypes indicated in FIG. 
2 is likely to be considerably higher than indicated. 

Discussion 

10 The objective of the present work was to identify a susceptibility gene for 

autism in chromosome region 2q24-q33. We identified two SNPs, rs2056202 and 
rs2292813, in SLC25A12 that demonstrated association with autism using the TDT. 
Preferential transmission of the G allele for both SNPs was found in 197 
informative families. Furthermore, linkage was found between autism and the SNPs 

15 by TDT and by non-parametric and parametric analyses. 

The SLC25A12 gene contains 18 exons spreading over about 110 kilobases 
(kb) and is expressed primarily as 2.9 and 3.2kb mRNA species, predominantly in 
skeletal muscle, heart and brain (25). SLC25A12 cDNA has an open reading frame 
of 2037 bp encoding a 678 amino acids protein that is a calcium-dependent 

20 mitochondrial aspartate/glutamate carrier (AGC1). The subcellular localization of 
protein is exclusively mitochondrial. The amino-terminal half of AGC1 contains five 
putative EF hands that are able to bind Ca2+, while the carboxy-terminal half 
harbors the aspartate/glutamate exchanger function. AGC1 is critically involved in 
the activity of the malate/aspartate NADH shuttle, catalyzing the electrogenic 

25 exchange of aspartate for glutamate and a proton, as well as in the urea cycle (26). 
Recently, it has been shown that the SLC25A12 gene, as well as AGC1, is the only 
form of the mitochondrial aspartate/glutamate carrier expressed in neurons and 
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neural stem cells (27). Protein levels increased during neuronal differentiation and 
are correlated with an increase in the malate/aspartate NADH shuttle activity. 

SLC25A12 mRNA and the AGC1 protein are widely expressed in adult 
mouse CNS, particularly in neural nuclei in the brainstem (27). It has been 
5 suggested that the enrichments of AGC1 in specific neurons could reflect a tonic 
activity of these neurons. Dysfunction of this protein, or altered expression of this 
protein, may lead to an alteration in mitochondrial function and ATP synthesis. As 
neurons are major energy users, even modest changes in mitochondrial function 
and ATP synthesis may lead to selective changes in neurons. Support for a role for 

10 mitochondrial dysfunction in autism comes from a recent study demonstrating 

mitochondrial hyperproliferation and partial respiratory chain block in two patients 
with autism and a 15q inverted duplication (28). 

As noted above, a region of chromosome 7 has also been shown to be linked 
to autism in multiple studies (29). It is interesting that a paralog of SLC25A12, 

15 namely SLC25A13 (CITRIN1 or AGC2), localizes to this region of chromosome 7. 
These two genes share about 79% identity, both encoding forms of 
aspartate/glutamate carriers, with 71% identity in the EF-hand domains and 84% 
within the exchanger domain. Mutations in SLC25A13 gene confers adult-onset 
type II citrullinemia (CTLN2, MIM#603471), an autosomal recessive disease 

20 caused by a deficiency of argino-succinate synthetase with clinical features 

included enuresis diarrhea, tremors, lethargy, mental retardation and psychiatric 
disorders. Although confirmation of the association between autism and SLC25A12 
is required, it is tempting to speculate that SLC25A13 could be a candidate autism 
susceptibility gene on chromosome 7. 

25 For both SNPs, the allele associated with autism corresponds to a common 

allele. One must consider that in complex disorders with multiple interacting genes, 
the prevalence of the susceptibility alleles may be quite high. Recent examples of 
this include the e4 allele of apoliprotein E in Alzheimer's disease, NOD-2 gene 
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changes in Crohn's disease, and calpain-10 gene changes and type 2 diabetes 
mellitus (30-32). In the case of autism, with the strongest evidence for numerous 
interacting genes, the expectation would be that the susceptibility variants for at 
least some loci will be quite common, and might even contribute to behavioral 
5 variability in healthy individuals. However, in any such study the true susceptibility 
locus or functional polymorphisms may not have been identified but rather are in 
linkage disequilibrium with the variants studied. The true susceptibility locus may 
even be in neighboring genes. In our studies, we have examined both flanking 
genes. For HAT1, we did not find any useful polymorphisms. For DNCI2, we found 

10 8 polymorphisms that were negative for association in first stage analyses. 

In our studies, we identified intronic polymorphisms associated with disease. 
The functional relevance of such variants remains obscure in most studies. 
However, it is increasingly being realized that the expression of a significant 
number of genes is regulated by cis-acting elements and that inherited variation in 

15 gene expression may contribute to disease (33, 34). Regulation of gene expression 
would affect cellular function, without requiring a modification in the coding 
sequence, if levels of the gene product were limiting. It has recently been 
demonstrated that over-expression of AGC1 can lead to increased mitochondrial 
ATP production (35), so genetic variations that change the expression of AGC1 

20 would be predicted to impact on ATP production. Whether the polymorphisms we 
identified (or additional polymorphism in linkage disequilibrium with the 
polymorphism we identified) affect gene expression needs to be determined. 

With all association studies, especially in complex disorders thought to be 
due to multiple interacting genes of weak effect, we must await replication in 

25 independent samples before the results can be accepted. However, given the 
evidence from our study, it may be unrealistic to expect that the finding will be 
easily replicated by TDT in a typical sample of under 200 trios. The TDT allows for 
robust statistical analysis without bias of population admixture. However, it lacks 
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power, especially at loci such as ours, where many of the parents were homozygous 
at the two loci. A carefully designed case-control study, with controls matched for 
ethnicity, gender, and age, may have more power to detect association at these two 
loci. Alternatively, genotyping several hundred trios for TDT would be in order for 
5 a replication study. 

Assuming that our results reflect true association of SLC25A12 with autism, 
the data indicate a genotype relative risk for the two-loci of between 2 and 5 (FIG. 
2). This, while significant, must be taken in context of ±e observation that the 
susceptibility variants (or the variants in linkage disequilibrium with the true 
10 susceptibility variants) are common alleles. This is consistent with the idea that this 
locus plays a significant role in the epidemiology of the disorder but would not be 
immediately useful for genetic counseling, until it can be considered together with 
additional loci. 

To further investigate SLC25A12 locus as a susceptibility gene for autism we 
15 are currently searching for additional genetic markers across the 110 kb region 
containing this gene, using both extant databases and sequencing in the patients 
we are studying, particularly of conserved non-coding regions. To date, more than 
90 SNPs encompassing the SLC25A12 gene have been identified in the public 
databases, with at least 20 SNPs harboring a heterozygosity rate over 0.1. None of 
20 these appear in conserved non-coding regions. We are also carrying out expression 
studies of AGC1. 

In view of the above, it will be seen that the several advantages of the 
invention are achieved and other advantages attained. 

As various changes could be made in the above methods and compositions 
25 without departing from the scope of the invention, it is intended that all matter 
contained in the above description and shown in the accompanying drawings shall 
be interpreted as illustrative and not in a limiting sense. 
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All references cited in this specification are hereby incorporated by 
reference. The discussion of the references herein is intended merely to summarize 
the assertions made by the authors and no admission is made that any reference 
constitutes prior art. Applicants reserve the right to challenge the accuracy and 
5 pertinence of the cited references. 
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Appendix - SEQ ID Nos 

SEQ ID NO:l region around rs2056262 that is amplified with SEQ ID NO:5 and 
SEQ ID NO: 6 - with A form; the A at the polymorphic site is underlined 
GTTACCCTGAGCTACAGTTTATATAGTAAAGG(^GATTAAATGCCTGATGCTTTCCC 
5 TTTCTTCACCAGCCAATGACGTTTTATATTTTATTC 

TTTTTGGCATTTGAATCTGTTTTATGTGCTCCAGATTCC^^ 

AGTTGTTTGACAAGAGTGGAAATGGAGAGGTGACATTTGGTAAGGGAAAAAGAAGA 
TTATAAGTGATAAGTTAATGATGCTGGTCCAGTCTTCAATTGCTGAATCTAGTAACT 
AATATAGATTACTGCTTATTTGGGATCTGA 

10 SEQ ID NO:2 region around rs2056262 that is amplified with SEQ ID NO:5 and 
SEQ ID NO:6 - with G form; the G at the polymorphic site is underlined 
GTTACCCTGAGCTACAGTTTATATAGTAAAGGCAGATTAAATGCCTGATGCTTTCCC 
TTTCTTCACCAGCCAATGGCGTTTTATATTTTATTCCAGGTTGATCT^ 
TTTTTGGCATTTGAATCTGTTTTATGTGCTCCAGATTCCATGTTC^TAG 

1 5 AGTTGTTTGACAAGAGTGGAAATGGAGAGGTGACATTTGGTAAGGGAAAAAGAAGA 
TTATAAGTGATAAGTTAATGATGCTGGTCCAGTCTTCAATTGCTGAATCTAGTAACT 
AATATAGATTACTGCTTATTTGGGATCTGA 

SEQ ID NO:3 region around rs2292813 that is amplified with SEQ ID NO:7 and 
SEQ ID NO: 8 - with A form; the A at the polymorphic site is underlined 

20 CCGCTCAAGTGGTTGAAGTTGCAGGCTCTATAGTAGATGTTAGCAGTATTCCTATCT 
TTTATAAGGTACTCTAGATAAATTAAATGTGGTTTTCTCCTGAAAGGTGTCCCAGCT 
GCATCTCTGGTGACCCCTGCTGATGTCATCAAGACAAGACTGCAGGTGGCTGCCCG 
CGCTGGCCAGACGA(^TACAGTGGTGTCATCGACTGTTTCAGGAAGATTCTCCGGG 
AAGAAGGGCCCTCAGCATTTTGGAAAGGGACTGCAGGTAGGCAGGGGCTGGAGCCA 

25 TACAGAATGGCTGGCTGGCTCTAGCGTCCTCCCCTGTGACTCAGTGGCTATCTTTAC 
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(^(^TTTGTCCTGGTTT(^GTCTCCCCrGCCCCTGCTTCTCTTTTTCAGCTCGAGTG 
TTTCGATCCTCTCCCCAGTTTGGTGTTACCTTGGTCACTTATGAACTTCTCCAGCGG 
TGGTTTTACATTGATTTTGGAGGCCTGTAAGTCAGCTGCTCAACTCCTTTACAAAGA 
AATCACTAAGTCCAAAACAAATGTTTGTTCTGTCTACAAAAGCATTGTTGCAACTCTT 
5 AGAAAACTGATAAGACAGAACCTTTAAGACCAATGC 

SEQ ID NO:4 region around rs2292813 that is amplified with SEQ ID NO:7 and 
SEQ ID NO: 8 - with G form; the G at the polymorphic site is underlined 
CCGCTCAAGTGGTTGAAGTTGCAGGCTCTATAGTAGATGTTAGCAGTATTCCTATCT 
TTTATAAGGTACTCTAGATAAATTAAATGTGGTTTTCTCCTGAAAGGTGTCCCAGCT 

10 GCATCTCTGGTGACCCCTGCTGATGTCATCAAGACAAGACTGCAGGTGGCTGCCCG 
CGCTGGCCAGACGACATACAGTGGTGTCATCGACTGTTTCAGGAAGATTCTCCGGG 
AAGAAGGGCCCTCAGCATTTTGGAAAGGGACTGCAGGTAGGCAGGGGCTGGAGCCA 
TACAGAATGGCTGGCTGGCTCTAGCGTCCTCCCCTGTGACTCAGTGGCTGTCTTTAC 
CACATTTGTCCTGGTTTCAAGTCTCCCCTGCCCCTGCTTCTCTTTTTCAGCTCGAGTG 

1 5 TTTCGATCCTCTCCCCAGTTTGGTGTTACCTTGGTCACTTATGAACTTCTCCAGCGG 
TGGTTTTACATTGATTTTGGAGGCCTGTAAGTCAGCTGCTCAACTCCTTTACAAAGA 
AATCACTAAGTCCAAAACAAATGTTTGTTCTGTCTACAAAAGCATTGTTGCAACTCTT 
AGAAAACTGATAAGACAGAACCTTTAAGACCAATGC 

SEQ ID NO:5 forward primer for amplifying rs2056202 
20 GTTACCCTGAGCTACAGTT 

SEQ ID NO:6 reverse primer for amplifying rs2056202 
TCAGATCCCAAATAAGCAG 

SEQ ID NO:7 forward primer for amplifying rs2292813 
CCGCTCAAGTGGTTGAAGTT 
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SEQ ID NO: 8 reverse primer for amplifying rs2292813 
GCATTG GTCTTAAAG GTTCTGTCT 
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What is claimed is: 

1. A method of evaluating an individual for relative genetic risk for autism, 
the method comprising determining the individual's genotype at polymorphism 
sites rs2056202 and/or rs2292813 of the SLC25A12 gene, wherein the presence of 

5 a G at either of the two sites indicates an increased risk for autism, and the 

presence of an increasing number of G's at the sites indicates an increasing risk for 
autism. 

2. The method of claim 1, wherein the genotype are determined by one or 
more methods selected from the group consisting of single strand conformation 

10 polymorphism, denaturing high-performance liquid chromatography, DNA Invader, 
and polymerase chain reaction amplification followed by sequencing. 

3. The method of claim 1, using polymerase chain reaction amplification 
with at least one primer comprising a sequence selected from the group consisting 
of SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, and SEQ ID NO:8. 

15 4. A set of two primers suitable for use in polymerase chain reaction, one 

primer comprising the sequence of SEQ ID NO:5 and the other primer comprising 
the sequence of SEQ ID NO:6. 

5. A set of two primers suitable for use in polymerase chain reaction, one 
primer comprising the sequence of SEQ ID NO: 7 and the other primer comprising 

20 the sequence of SEQ ID NO:8. 

6. A kit comprising at least one set of primers suitable for use in polymerase 
chain reaction (PGR), wherein the set of primers amplifies polymorphism site 
rs2056202 of the SLC25A12 gene. 
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7. A kit comprising at least one set of primers suitable for use in polymerase 
chain reaction (PCR), wherein the set of primers amplifies polymorphism site 
rs2292813 of the SLC25A12 gene. 

8. A kit comprising at least two sets of primers suitable for use in 
5 polymerase chain reaction (PCR), wherein one set of primers amplifies 

polymorphism site rs2056202 of the SLC25A12 gene ancLthe second set of primers 
amplifies polymorphism site rs2292813 of the SLC25A12 gene. 

9. The kit of claim 8, wherein one set of primers consists of a primer 
comprising the sequence of SEQ ID NO:5 and a primer comprising the sequence of 

10 SEQ ID NO:6, and the second set of primers consists of a primer comprising the 
sequence of SEQ ID NO:7 and a primer comprising ±e sequence of SEQ ID NO:8. 

10. The kit of any one of claims 6-9, further comprising instructions for 
using the two sets of primers to evaluate an individual for relative genetic risk for 
autism by determining the genotype of the' polymorphic sites re2056202 and/or 

15 re2292813 of the SLC25A12 gene. 

9. A polynucleotide consisting of the sequence selected from the group 
consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, and SEQ ID NO:4. 

10. A method of identifying a form of a genetic polymorphism that is linked 
to autism, the method comprising identifying a polymorphism in the SLC25A12 

20 gene and determining whether one form of the polymorphism is present in autistic 
individuals more than another form, wherein the form that is present more often in 
autism is linked to autism. 
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11. Use of a genetic polymorphism linked to autism identified by the 
method of claim 10 for evaluating an individual for relative genetic risk for autism. 

12. A eukaryotic cell comprising a transgenic human SLC25A12 gene. 

13. The eukaryotic cell of claim 12, wherein the transgenic SLC25A12 gene 
5 comprises the sequence of SEQ ID NO:2 and/or SEQ ID NO:4. 

14. The eukaryotic cell of claim 12, wherein the cell is a yeast cell. 

15. The eukaryotic cell of claim 12, wherein the cell is a mammalian cell. 

16. The eukaryotic cell of claim 12, wherein the cell is a brain cell. 

17. The mammalian cell of claim 15, wherein the cell is in a living mammal. 
10 18. A non-human animal comprising the eukaryotic cell of claim 12. 

19. The non-human animal of claim 18, wherein the animal is a mammal. 

20. A method of evaluating whether a compound affects autism, the method 
comprising contacting the compound with the eukaryotic cell of claim 12 and 
determining whether the compound affects expression or activity of a product of 

15 the SLC25A12 gene, wherein a compound that affects expression or activity of the 
product of the SLC25A12 gene affects autism. 
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Abstract 

Methods and compositions are provided for evaluating an individual for 
relative genetic risk for autism, for identifying a form of a genetic polymorphism 
that is linked to autism, and for evaluating whether a compound affects autism. 
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